Automatic Detection of Brazil’s Prosodic Tone Unit
نویسندگان
چکیده
This research is focused on the automatic detection of one of the fundamental elements of Brazil’s prosody model, the tone unit. We compared the performance of using silent pause duration alone to delimit tone units and using relative pitch resets and slow pace (or post-boundary lengthening) along with silent pause duration to delimit them. The corpus used for the comparison is composed of 18 highly proficient speakers giving academic lectures in six varieties of English which are representative of the inner (American and British), outer (Indian and South African), and expanding (Chinese and Spanish) concentric circles of Kachru’s World Englishes. The performance was compared by computing Pearson’s correlation between the numbers of tone units in a trained linguist’s transcription of the corpus and the numbers automatically detected by the computer. The computer detected the tone units from phone sequences identified in the audio files by a large vocabulary spontaneous speech recognition (LVCSR) program. We found including relative pitch resets and slow pace along with silent pause duration in the computer algorithm improved the correlation between the numbers of tone units in the linguist’s transcription of the corpus and the numbers automatically detected by the computer from 0.935 to 0.959.
منابع مشابه
Apa: an Object Oriented System for Automatic Prosodic Analysis
.....................................................................................................................................................7 List of Figures ...........................................................................................................................................9 List of Tables .............................................................................
متن کاملAPA: towards an Automatic Tool for Prosodic Analysis
In this paper a tool for the speech signal prosodic analysis is described. The system APA (Automatic Prosodic Analysis) is based on a tool for speech segmentation into syllabic units and on their description in terms of pitch, energy and duration. A particular linear stylization of the fundamental frequency function is proposed, which helps in describing efficiently intonation movements at phra...
متن کاملProsodic Structure Representation for Boundary Detection in Spontaneous French
Automatic speech processing has recently turned to the treatment of continuous spontaneous speech, which demands, among many other issues, a representation of its prosodic organization. This paper presents a new approach to automatic prosodic boundary detection and prosodic unit structuring, based, with certain changes, on a descriptive theory of the French prosodic system initially proposed fo...
متن کاملKorean MULTEXT: A Korean Prosody Corpus
This paper describes the contents of the Korean prosody corpus (Korean MULTEXT), which is a Korean version of the speech database Eurom1. The corpus consists of about 2 hours of read speech, transcribed primarily in orthography (in Korean alphabet and in a Romanized transcription), in IPA and in SAMPA. Furthermore, it includes the original F0 values, stylized F0 values extracted using Momel, an...
متن کاملEvaluation of Automatic Generation of Prosody with a Superposition Model
A new paradigm for modelling prosody is introduced. We assume that global melodic prototypes are built and stored in a "prosodic lexicon". The actual generation of adequate prosodic contours is achieved by retrieving and combining these elementary global contours accessed by linguistic keys. Two automatic F0 generation procedures have been used: The first consists of a structured lexicon, the s...
متن کامل